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Abstract: — Several experimental situations give rise to analyzing time to response on observational units (survival data) 
using split plot in time models. The general structure of such experiments is that the observation of the time of occurrence of 
an event (called a death, failure, or response) is of interest, the observational units are grouped into whole units and the 
treatments are randomized to whole units if time to the occurrence of an event T is continuous random variable then whole 
units would be considered as subsamples. If time response was grouped into intervals in the above setting, then the sufficient 
statistics in this case would be the counts of observed occurrences of an event (number of deaths failure) within intervals. 
The experiment can then be viewed as a split plot over time where time intervals periods are subunits and whole units would 
be the same as in continuous time setting, and the response variable is some function of the counts. 
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I. INTRODUCTION 

In life testing and medical follow up, observation of the time of occurrence of the event (called death, failure, or 
response) is of interest. Sometimes these occurrences may be prevented for some of the items of the sample by the 
occurrence of some other event (called loss or censoring). Kaplan and Meier (1958) assumed that the life time is independent 
of the potential loss time, and they provided, for random samples of size N, the product-limit (PL) estimate that can be 
defined as follows. List and label the N observed lifetimes (whether to death or loss) in order so that one has 0 < 1 1 < t 2 < 
... < tjv • Then P(t) = rii-tOV — r )/{N — r+ 1)], where r assumes those values for which t' r < t, and for which t' r 
measures the time to death. This is the distribution-free estimator which maximizes the likelihood function. 

Cox (1972) considered the analysis of censored failure times. He suggested a regression model for the failure time 
T of an individual when values of one or more explanatory variables were available. For T continuous, the hazard function is 
given by 

X(t,Z) = l 0 (t,Z)exp (/?'Z) , 

Which is known as proportional hazard function. It is also known as the multiplicative from of the hazard function with (3 
being the vector of the unknown parameters, and X 0 (t) is the underlying hazard function when Z=0. For T discrete, the 
logistic model was suggested. A conditional likelihood and maximum likelihood estimates were obtained. However, Cox 
(1972) proportional hazard regression model does not handle grouped survival data or large data sets with many ties (many 
individuals failed at the same time). 

Kalbfleisch and Prentice (1973) obtained a marginal likelihood for the regression parameters by restricting the 
class of models presented by Cox (1972) to those that possessed a strictly monotone survivor function or, equivalently, to 
those for which the hazard function X, 0 (t) was not identically zero over an open interval. The invariance of this restricted class 
under the group of monotone increasing transformations on T was exploited to derive a marginal likelihood function for p. If 
no ties occur their results and the results of Cox (1972) are the same with a simple justification. But if ties occur in the data 
the results obtained by Kalbfleisch and prentice (1973) are different from those suggested by Cox (1972). 

We have generalized the Cox (1972) model to include main unit variability to be able to get the split plot in time 

model. 

II. SPLIT PLOT AND VARIANCE COMPONENT LITERATURE 

Our model for survival analysis is based on using a split plot in time model, and therefore we need to consider the 
related literature. 

What we need in the variance component analysis is a method for split plot models with unequal sub-plot 
variances. We must mention here that we could not find any work in the literature that has been done for this particular 
study. However, a list and a presentation of the literature that has been done in both split plot model and variance component 
areas separately and combined will be considered. Some of the listed literature might not be of direct relation to our study, 
and some are related in the sense that they gave us an idea on the approach that we have used for variance component 
estimation. 

A common assumption in split plot experiments is that the error variance for subplot treatments are the same. 
Curnow (1957) provided tests of significance for the departure from equality of the variances for different subplot 
treatments. Also, an estimate of tie ratio of a pair of such variances was provided in this paper. For the balanced two-way 
layout split plot design, Li and Klotz (1978) compared maximum likelihood estimators and restricted maximum likelihood 
estimators with minimum variance unbiased estimators of variance components. Performance was compared in terms of man 
squared error for the three estimators. For a general mixed-effects model Brown (1978) viewed the problem of estimating 
variance components in the context of linear model theory. 



ISSN: 2278-7461 



www.ijeijournal.com 



Page II 



Survival Analysis Using Split Plot in Time Models 



The approach was to estimate the unknown vector of a parameters (3 by some vector b and thus obtain a vector of 
residuals e=Y-Xb. A vector of the squares and cross products of the residuals was then obtained, the expectation of which 
was a known linear transformation of the variance components. 

III. GROUPED TIME, MULTIPLICATIVE AND ADDITIVE HAZARD CONDITIONAL 
ON MAIN UNIT WITH NORMAL MAIN UNIT ERROR 

As presented later, the structure for design that will be considered is that we have J main units per treatment 
combination according to a CRD, riy observation units in each main unit and time to response on each observational units is 
measured. Time to response is grouped into intervals where the points defining the time intervals are denoted by 0 = t 0 < 
tj < t 2 < ■■■ < t K . The number of failures or deaths in time interval k, k=l,....,k is the number of failures or deaths in time 
interval (tk_i,tjj. 
Define 

riij : number of individuals assigned to main unit j of trt i, 
r t j k : number of individuals failed on trt i, main unit j during time interval k, 
Sijk : number of individuals survived interval k for trt i and main unit j, and 
n ijk : number of individuals at risk for trt i, main unit j and time interval k. 
For the no censoring case we have 

n ij =n ijl , and n ijk = S i; (fc_i) fork>\. 
For the censoring case we have to define C i;ft as the number censoring during the kth interval, then 

n ij l — n ij ~ c ij 1 1 an d 

n ijk = S ij(k-1) - c ijk = n ij (k-1) _ r ij (k-1) ~ c ijk f or k > 1. 
Also, define 

Pijic ■ conditional conditional probability that an individual on trt i and main unit j fails in interval k given that it survived 
k — 1 time intervals, and 

q t j k = 1 — P ijk : conditional probability of surviving interval k given survival of k-1 time intervals for an 

individual on trt i and main unit j. 

Now, let F t j (t) be the cumulative distribution function for the continuous response time random variable T for a given main 
unit j. Define S; ; - (t) = 1 — F t j (t) to be the survival function for trt i and main unit j. 
By definition, P i]k : conditional can be written as 

/failing in time interval k for an individual \ 
V on tretment i and main unit j / 



fijk 



/ surviving (k — l)time in intervals for \ 
Van individual on treatment i and main unit j/ 



. F V(h) ~ F iJ fa-i) 
1-F ij(tk-i) 



1 K 1 '^^-!)) 

1 „ ( X ~ F 0fa)) S ij(tk) f o*. 

Qijk = 1 - Pijk = -r. — ~ t = o (3-1) 

(l-FiKt^J 5 v(t t -i) 

Define the hazard function A,(t) as the limiting conditional probability of failing in an interval given surviving until 

that interval as the interval shrinks, to be 

I'r (t < T < t + Ar/T > t) /(t) 

l(t) = lim = — — , 

At^o At S(t) 

Where f(t) and S(t) are the density function and the survival function, respectively, for the continuous response time random 

variable T. Cox (1972) suggested a regression model for the failure time T of an individual when values of one or more 

explanatory variables are available. For T continuous the hazard function is of form 

l(t,z) = l 0 (t)exp((3z) 

Which is Known as the proportional hazard function, also known as a multiplicative from of the hazard function, where X, 0 (t) 
is the underlying hazard function when Z=0. (3 is the vector of unknown parameters. 

For our problem we generalize Cox's (1972) model to include the extra variability involved. In other words we will try 
to model the continuous time variable in a way related to Cox's (1972) model to include the random component £|j. 
The multiplicative hazard function for trt i and main j that will be considered is as follows 

A i; (t)=A 0 (t)e%p(^Xi(t)+ Sjj), 
Where X, 0 (t) is the underlying hazard when Xj(t)=0 and 8^=0, (3 is the vector of unknown parameters and X;(t) are the 
variables influencing failure times. Also, the survival function for trt i given main unit j is given by 
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^(t) = l-Fij(t) = exp -J^(«)du 



and therefore 



S-. (t) = exp - 1 A 0 (u) exp(y?' Xi (u) + £■■ ) du 



. (3.2) 



Substituting (3.2) in (3.1) we get 



f U 



q ijk = exp 



J 
\ 



- J^ 0 ( u ) ex P(^' x i( u ) + £ ij) du 



qi jk =exp 

and this lead to 



V l n J 
Now, let us assume that Xj(t) is constant on interval k, i.e. , let X ik = value of X;(t) on interval k. Then we have 

-expf/Tx^+eJ j\(u)du 



Log(-Logq ijk ) = Aik + Eg + Log j\(u)du. 



. (3.3) 



Let 



T k = Log 



.) du , then 



tk-1 

Log (-Log q ijk ) = 0X ik + £ i; - + T k , where /? e K p , T k E R , and 

Log (-Log q ijk ) = Log (-Log q ijk ) + , where q ijk = — , and S ijk is a random error defined by S ijk = 

n ijk 

Log (-Log q ijk ) - Log (-Log q ijk ). 

For the additive from of the hazard function we generalize the model presented by Elandt -Johnson (1980) to 
include the random component in an additive fashion. Now we drive the model that will be used later in analysis using the 
additive hazard model which is given by 

X ij (t)=X 0 (t)+ /?'xi(t) + eij 
The survivor function is then given by 

S a (t) = l-F(f) 



t 

S,. (t) = exp (- J A 0 (u) + p% (u) + ^ ) du (3 

o 

t 

= exp(-j\ j (u)du) 



.4) 



0 

Substituting (3.4) in (3.1) we get 



f U 



lijk 



exp 



ja 0 (u) + /?' Xl (u) + £ij ) 



du 



Again assume that Xi (t) is constant on interval k. In this case we have 



f 



q ljk = exp 

and this leads to 



}(X 0 (u)+^X i (u) + 8 ij ) 



du 
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log (qijk ) = $X ik (t fc _! - t k ) + E t j (t fc _! - t k ) + (- j A 0 (w) du ) (3.5) 

tk-i 

tk 

Define Z ik = X ik (t ft _! - t ft ) , £ i; - = £ i; - (t ft _! - t k ), and T k = - J A 0 (u) du. 
Then 

Log(q ijk ) = /?z ik + ^ + r fc , where /? £ J t £l 

Log (q ij7c ) = Log (q ijA . ) + , where q ijk = , and S ijk is a random error defined by S ijk = Log (q ijk ) - 
Log(q ijk ) . 

Our grouped time model given by the : 

Response = ji + a t + £y + p k + (afi) ik + S ijk 
where i=l,2, ... ,I,j=l,2, ...,J, and k= 1 , 2, . . . , K . is similar to these continuous models in the sense of having similar set 
of parameters. Therefore we can start with continuous setting for response time T and still and up with grouped time model 
that we considered for analysis although in our case response time T is discrete random variable. 

It is appropriate here to mention that the proportional hazards model is convenient, e.g., the log(-log) model is to 
be preferred over the log model for the following two reasons: 

1) Using the proportional hazards model leads to work with log(-log) model specified by the equation. 

tk 

log(-log q ijk ) = 0X ik + e i; - + log j A Q (u) du . 

t-k-l 

However, using the additive from for the hazard leads to work with log model specified by the equation 

tk 

log (qijk ) = $X ik (tfc_! - t k ) + E tj (t fc _! - t k ) + (- J ^o(") du )■ 

t k -i 

Therefore, inference with log(-log) transform is directly related to the parameters of the continuous time interpretation. The 
log(-log) model is to be preferred since (3 is invariant to time grouping. 

2) The log model has a restricted rangeq^ ' s are observed proportions and thusO < q iJk < 1 , which 

implies that log(q ijk ) < 0 
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